Search CORE

5 research outputs found

Performance Analysis of the SHA-3 Candidates on Exotic Multi-core Architectures

Author: D. Patterson
D.A. Osvik
H.P. Hofstee
J. Daemen
J.W. Bos
M. Bellare
M. Stevens
O. Takahashi
R. Benadjila
R. Szerwinski
S. Marechal
S.A. Manavski
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

The NIST hash function competition to design a new cryptographic hash standard 'SHA-3' is currently one of the hot topics in cryptologic research, its outcome heavily depends on the public evaluation of the remaining 14 candidates. There have been several cryptanalytic efforts to evaluate the security of these hash functions. Concurrently, invaluable benchmarking efforts have been made to measure the performance of the candidates on multiple architectures. In this paper we contribute to the latter; we evaluate the performance of all second-round SHA-3 candidates on two exotic platforms: the Cell Broadband Engine (Cell) and the NVIDIA Graphics Processing Units (GPUs). Firstly, we give performance estimates for each candidate based on the number of arithmetic instructions, which can be used as a starting point for evaluating the performance of the SHA-3 candidates on various platforms. Secondly, we use these generic estimates and Cell-/GPU-specific optimization techniques to give more precise figures for our target platforms, and finally, we present implementation results of all 10 non-AES based SHA-3 candidates

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Automatic Parameter Optimization for Edit Distance Algorithm on GPU

Author: S.A. Manavski
Y. Liu
Y. Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Crossref

(2009)" GPU Parallelization of Algebraic Dynamic Programming

Author: A.V. Aho
D. Mathews
D.B. Searls
J. Reeder
M.C. Schatz
R. Durbin
R. Giegerich
R. Nussinov
S.A. Manavski
W. Liu
Y. Liu
Publication venue
Publication date: 13/09/2009
Field of study

Abstract. Algebraic Dynamic Programming (ADP) is a framework to encode a broad range of optimization problems, including common bioinformatics problems like RNA folding or pairwise sequence alignment. The ADP compiler translates such ADP programs into C. As all the ADP problems have similar data dependencies in the dynamic programming tables, a generic parallelization is possible. We updated the compiler to include a parallel backend, launching a large number of independent threads. Depending on the application, we report speedups ranging from 6.1 × to 25.8 × on a Nvidia GTX 280 through the CUDA libraries.

CiteSeerX

HAL - Lille 3

Crossref

INRIA a CCSD electronic archive server

Noh performance of Yumi Yawata

Author: A. Biryukov
A. Sloss
D. Blythe
D. Seal
D.J. Bernstein
E. Biham
E. Käsper
H.P. Hofstee
J. Daemen
J. Owens
J. Yang
K. Atasu
M. Feldhofer
O. Takahashi
S. Tillich
S.A. Manavski
Publication venue
Publication date: 01/01/2010
Field of study

Paper and PresentationThis paper presents new software speed records for AES-128 encryption for architectures at both ends of the performance spectrum. On the one side we target the low-end 8-bit AVR microcontrollers and 32-bit ARM microprocessors, while on the other side of the spectrum we consider the high-performing Cell broadband engine and NVIDIA graphics processing units (GPUs). Platform specifi c techniques are detailed, explaining how the software speed records on these architectures are obtained. Additionally, this paper presents the first AES decryption implementation for GPU architectures

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Global Performing Arts Database

Calhoun, Institutional Archive of the Naval Postgraduate School

Parallel Shortest Lattice Vector Enumeration on Graphics Cards

In this paper we present an algorithm for parallel exhaustive search for short vectors in lattices. This algorithm can be applied to a wide range of parallel computing systems. To illustrate the algorithm, it was implemented on graphics cards using CUDA, a programming framework for NVIDIA graphics cards. We gain large speedups compared to previous serial CPU implementations. Our implementation is almost 5 times faster in high lattice dimensions. Exhaustive search is one of the main building blocks for lattice basis reduction in cryptanalysis. Our work results in an advance in practical lattice reduction

CiteSeerX

TUbiblio

Crossref

Cryptology ePrint Archive